Medical Acronym Disambiguation Using Online Sources
نویسندگان
چکیده
Hospitals produce millions of patient records consisting of clinical annotations containing extensive usage of abbreviations. The data in these clinical annotations are an excellent source for bioinformatics research but the use of abbreviations can create ambiguity. The main objective of our research is to develop a software application that takes a medical acronym as input and accesses medical and pharmaceutical websites to retrieve information from articles containing the acronym along with a user selected full-form of the acronym. The retrieved information consists of article title, authors, publication date, article abstract, and Medical Subject Header (MeSH) details. The information is used by researchers in the Biomedical Informatics Division of the Cincinnati Children’s Hospital Medical Center as part of a research effort for reducing the ambiguity created by the use of acronyms. Our contribution to this research effort is a framework for disambiguation that accesses online sources; we design and populate an internal database that can be used in future research efforts.
منابع مشابه
Automated Identification of Synonyms in Biomedical Acronym Sense Inventories
Acronyms are increasingly prevalent in biomedical text, and the task of acronym disambiguation is fundamentally important for biomedical natural language processing systems. Several groups have generated sense inventories of acronym long form expansions from the biomedical literature. Long form sense inventories, however, may contain conceptually redundant expansions that negatively affect thei...
متن کاملAcronym Disambiguation Using Word Embedding
According to the website AcronymFinder.com which is one of the world's largest and most comprehensive dictionaries of acronyms, an average of 37 new human-edited acronym definitions are added every day. There are 379,918 acronyms with 4,766,899 definitions on that site up to now, and each acronym has 12.5 definitions on average. It is a very important research topic to identify what exactly an ...
متن کاملKernel Methods for Word Sense Disambiguation and Acronym Expansion
The scarcity of manually labeled data for supervised machine learning methods presents a significant limitation on their ability to acquire knowledge. The use of kernels in Support Vector Machines (SVMs) provides an excellent mechanism to introduce prior knowledge into the SVM learners, such as by using unlabeled text or existing ontologies as additional knowledge sources. Our aim is to develop...
متن کاملA Comparative Study of Supervised Learning as Applied to Acronym Expansion in Clinical Reports
Electronic medical records (EMR) constitute a valuable resource of patient specific information and are increasingly used for clinical practice and research. Acronyms present a challenge to retrieving information from the EMR because many acronyms are ambiguous with respect to their full form. In this paper we perform a comparative study of supervised acronym disambiguation in a corpus of clini...
متن کاملSemi-Supervised Maximum Entropy Based Approach to Acronym and Abbreviation Normalization in Medical Texts
Text normalization is an important aspect of successful information retrieval from medical documents such as clinical notes, radiology reports and discharge summaries. In the medical domain, a significant part of the general problem of text normalization is abbreviation and acronym disambiguation. Numerous abbreviations are used routinely throughout such texts and knowing their meaning is criti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007